CDS
Accession Number | TCMCG022C41458 |
gbkey | CDS |
Protein Id | XP_039170811.1 |
Location | join(8949528..8949702,8949788..8949870,8950186..8950234,8950237..8950279,8950699..8950802,8951148..8951317,8951419..8951533,8951932..8951987,8952076..8952112,8952298..8952398,8952777..8952839,8952922..8952984,8953102..8953184,8953359..8953467,8953567..8953620,8954270..8954357,8954731..8954768,8955329..8955434,8955628..8955794,8955918..8955977) |
Gene | LOC104448212 |
GeneID | 104448212 |
Organism | Eucalyptus grandis |
Protein
Length | 588aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA698663 |
db_source | XM_039314877.1 |
Definition | LOW QUALITY PROTEIN: imidazole glycerol phosphate synthase hisHF, chloroplastic [Eucalyptus grandis] |
EGGNOG-MAPPER Annotation
COG_category | E |
Description | belongs to the HisA HisF family |
KEGG_TC | - |
KEGG_Module |
M00026
[VIEW IN KEGG] |
KEGG_Reaction |
R04558
[VIEW IN KEGG] |
KEGG_rclass |
RC00010
[VIEW IN KEGG] RC01190 [VIEW IN KEGG] RC01943 [VIEW IN KEGG] |
BRITE |
ko00000
[VIEW IN KEGG] ko00001 [VIEW IN KEGG] ko00002 [VIEW IN KEGG] ko01000 [VIEW IN KEGG] |
KEGG_ko |
ko:K01663
[VIEW IN KEGG] |
EC | - |
KEGG_Pathway |
ko00340
[VIEW IN KEGG] ko01100 [VIEW IN KEGG] ko01110 [VIEW IN KEGG] ko01230 [VIEW IN KEGG] map00340 [VIEW IN KEGG] map01100 [VIEW IN KEGG] map01110 [VIEW IN KEGG] map01230 [VIEW IN KEGG] |
GOs | - |
Sequence
CDS: ATGGAGGCGGCGCCGTTCGGCTCGGCATTCACATCTAGATCGCTCTCCCCACCCTCTTCATCGTCTTCCACCTCGCAATCCCTCCTTTGCTTCCATTTCAACAAGGCCCGCCTCAAGCTCAAATCGCCCAGAACCTTCGCCGTTCGCGCCGCCGCCGCTCGGGGAGGAGATTCTGTGGTGACTCTGCTTGATTATGGTGCCGGCAACGTCCGTAGCGTCCGGAATGCTATTCGCCACCTCGGCTTCGACATTAAAGATGTGCAAACTCCTGAAGATATTTTGGGTGCAAATCGCCTCATCTTCCCGGGTGGGGCATTTGCTTCTGCCATGGATGTGTTGAATAAGAAAGGGATGGCTGAAGCTCTCTGCACTTATATTATGGAAGATCGCCCCTTCCTAGGCATATGTCTTGGACTGCAACTACTCTTTGAGTCAAGTGAGGAGAATGGCCCAGTCCGTGGTCTTGGAATAATCCCTGGAGTGGTTGGTCGCTTTGATGCATCCAGTGGTTTGAGAGTGCCTCACATTGGCTGGAATGCCTTGAAAGTTAGAGAAGGCTCTGAAATTTTGGATGATATTGGAAATCACCATGTCTATTTTGTTCACTCCTACCGTGCCCTACCAACAAACGACAACATGGATTGGGTTTCCTCTACCTGCAATTATGGCGACAATTTCATAGCCTCTGTAGGGAAGGGAAATGTGCATGCAGTACAATTTCACCCAGAAAAAAGTGGAGATGTGGGTCTGACGGTATTGAGAAGATTCTTGTCGCCAAAGTCACATGTAACAGAGAAGCCTACAGAAGGGAATGCCTCAAAGCTTGCTAAAAAGGTAATTGCTTGTCTTGATGTGAGGGCAAACGACAAGGGAGATCTTGTTGTTACCAAAGGCGACCAATATGATGTGAGAGAGCGCACAGAAGAGAATGAGGTGAGGAACCTCGGTAAGCCTGTGGACCTAGCTGGGCAGTATTATAAGGATGGAGCAGATGAAGTCAGTTTTCTGAATATTACTGGTTTCCGCGACTTCCCCCTGGGTGACTTACCGATGCTGCAGGTATTGAGATACACATCAGAAAATGTTTTTGTACCACTAACTGTTGGAGGTGGAATTAGAGATTTCACGGACGCAAATGGCAGGTACTATTCCAGTCTAGAAGTGGCTTCAGAATATTTTAGATCTGGGGCGGATAAGGTTTCTATAGGCAGTGATGCGGTTTATGCTGCTGAAGAATATCTAAGAACCGGAGTGAAGAGTGGAAAGAGCAGCTTAGAACAGATATCTAGAGTCTATGGGAATCAGGCAGTGGTAGTGAGCATAGATCCTCGTAGGATGTACATCAAGAGTCCCGAAGATGTGGAGTTCAGATCTACAAGGGTAACAAATCCAGGTCCAAATGGAGAAGAATATGCTTGGTACCAGTGCACGGTCAATGGAGGACGAGAAGGTCGGCCAATTGGAGCTTACGAGCTTGCAAAGGCTGTTGAAGATTTGGGTGCTGGAGAGATATTGCTTAACTGCATTGACTGTGATGGTCAAGGAAAAGGATTCGATGTCGATTTAGTGAAGCTGATTTCTGATGCCGTGAGCATCCCAGTGATAGCAAGTAGCGGTGCGGGCTGTGTGGAGCACTTTACGGAGGTATTTGAGAAGACCAATGCATCCGCTGCGCTTGCTGCAGGGATATTCCACCGGAAGGAGGTGCCGATTCAGGCTGTGAAGGAGCACTTGCTAAAGGAAGGCATAGAAGTAAGAATCTAG |
Protein: MEAAPFGSAFTSRSLSPPSSSSSTSQSLLCFHFNKARLKLKSPRTFAVRAAAARGGDSVVTLLDYGAGNVRSVRNAIRHLGFDIKDVQTPEDILGANRLIFPGXGAFASAMDVLNKKGMAEALCTYIMEDRPFLGICLGLQLLFESSEENGPVRGLGIIPGVVGRFDASSGLRVPHIGWNALKVREGSEILDDIGNHHVYFVHSYRALPTNDNMDWVSSTCNYGDNFIASVGKGNVHAVQFHPEKSGDVGLTVLRRFLSPKSHVTEKPTEGNASKLAKKVIACLDVRANDKGDLVVTKGDQYDVRERTEENEVRNLGKPVDLAGQYYKDGADEVSFLNITGFRDFPLGDLPMLQVLRYTSENVFVPLTVGGGIRDFTDANGRYYSSLEVASEYFRSGADKVSIGSDAVYAAEEYLRTGVKSGKSSLEQISRVYGNQAVVVSIDPRRMYIKSPEDVEFRSTRVTNPGPNGEEYAWYQCTVNGGREGRPIGAYELAKAVEDLGAGEILLNCIDCDGQGKGFDVDLVKLISDAVSIPVIASSGAGCVEHFTEVFEKTNASAALAAGIFHRKEVPIQAVKEHLLKEGIEVRI |